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RICE ACTIN GENE AND PROMOTER 

Significant progress has recently been made in the 
application of tissue culture and gene transfer techniques to 
5 previously recalcitrant monocotyledonous plants, such as rice. 
Rice protoplasts have been shown to transiently express a foreign 
gene after transformation by electroporation, and stable genomic 
integration of foreign DNA, following polyethylene-glycol- 
mediated transformation of rice cells, has been previously 
1 0 demonstrated. 

One of the major limitations in rice transformation 
technology has been the lack of an efficient promoter for high 
level constitutive expression of foreign genes in transgenic 
plants. The cauliflower mosaic virus (CaMV) 35S promoter has 

1 5 been widely used for this purpose in a number of plant 

transformation systems, however, the CaMV 35S promoter has 
shown low activity in transforming rice cells, and recent reports 
suggest that the pattern of CaMV 35S promoter activity in 
transgenic plants may not be constitutive. Work in our laboratory 

2 0 suggests that the 5' region of the maize Adh1 gene containing the 

promoter, 5'-coding exon and 5'-intron, is 20 to 30 times more 
active than the CaMV 35S promoter for transient gene expression 
in transformed rice protoplasts and call}, however, the maximum 
activity of AdH1 promoter in transformed rice protoplasts 
2 5 requires anaerobic stress and its activity is not constitutive in 
all transformed rice tissues. As a result, a number of alternative 
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promoter elements for rice transformation are currently under 
investigation. 

It is, therefore, the purpose of the present invention to 
describe an efficient promoter for rice transformation, and in 
5 doing so have based such a promoter upon a rice actin gene that 
displays a constitutive spatial and temporal pattern of transcript 
abundance. 

Cytoplasmic actin is a fundamental component of the 
eukaryotic cell cytoskeleton. In higher-plant cells a number of 

0 cellular process, such as cytoplasmic streaming, extension 
growth and cell division, are believed to involve cytoskeletal 
actin protein. Actin has been found to be encoded by gene 
families in all higher plants studied to date. A unique feature of 
these plant actin genes, in contrast to the situation for animal 

5 actin genes, has been their conservation of gene structure. All of 
the studied plant actin genes are reported to consist of four 
coding exons of conserved length, separated by three introns of 
variable length. In each of these cases the coding regions of the 
plant actin genes have been deduced following a comparison of 

0 their potential translations products with that of previously 
published animal actin protein sequences. However, in a number 
of animal actin genes there exists a 5' transcribed, but 
untranslated, sequence (5'-noncoding exon), separated by an intron 
(5'-intron) from the exon containing the first translated codon 

5 (5'-coding exon). Although it has been suggested that a 5- 

noncoding exon may exist in plant actin genes, such an exon would 
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fail to be detected in any comparison of the coding region of plant 
and animal actin genes. 

In rice, there are at least 8 actin-like sequences per haploid 
genome, of which four have been isolated and shown to differ 
5 from each other in the tissue and stage specific abundance of 
their respective transcripts. One rice actin gene, RAct, was 
found to encode a transcipt that is relatively abundant in all rice 
tissues, and at all developmental stages examined. In view of the 
high level constitutive abundance of the rice RAc1 transcript the 

10 present invention describes the isolation and partial sequencing 
of a genomic clone containing the rice RAct 5' flanking and 5- 
transcribed, non-translated, regions. From such clones, a number 
of RAc1-GUS (GUS stands for B-glucuromidase gene) fusions 
plasmids were constructed and used in transient expression 

15 assays of transformed rice protoplasts. The results from these 
assays suggest that the regulatory element(s) necessary for 
maximal RAc1 promoter activity in transformed rice protoplasts 
are located within a region 1 .3 kb upstream of the RAc1 
translation initiation codon. The rice RAc1 promoter is 5 times 

2 0 more active than the maize Adhl promoter in transformed rice 
cells, suggesting that the RAc1 5* region contains an efficient 
promoter for rice transformation. 

As a first step towards a characterization of the actin gene 
in rice, several clones from a rice genomic library representing 

2 5 four unique actin sequences were isolated. A rice (Oryza sativa 
variety IR26) genomic library in lambda-EMBL4 was screened 
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with a heterologous actin probe. Fourteen independent clones 
were isolated and subcloned into pUC13. By mapping restriction 
sited and carrying out cross-hybridization studies, four different 
classes of clones were identified and designated RAc1 , RAc2, 
5 RAc3 and RAc7. 

Sequencing of the subcloned actin genes was carried out by 
the dideoxynucleotide chain termination method and computer 
analysts of the resulting sequences was done. The nucleotide 
sequences of the coding regions of the four actin genes appear in 
1 0 the EMBL Gene Bank and DDB7 Nucleotide Sequence Databases 

under the accession numbers X15865 RAc1; X15864 RAc2; X15862 
RAc3; and X15863 RAc7. 

Prior to identifying those sequences which regulate RAc1 
expression, its complete genomic structure was determined. By 

1 5 characterizing an RAc1 cDNA clone (EMBL data bank accession 

number X16280) and 5* end mapping the RAc1 transcript, the 
structure of the RAc1 gene was determined and the position of a 
5'-noncoding exon in its genomic sequence was identified and 
located. This represents the first complete structural 

2 0 characterization of plant actin gene and is one of few reported 

cases of a S'-noncoding exon in a plant gene. 

A complete understanding of all aspects of the present 
invention can be better understood from reference to the 
accompanying figures (and examples) in which: 
25 Figure 1a is a restriction map of the pcRAc1.3 cDNA insert, 

according to the present invention; 
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Figure 1b is a restriction map and proposed structure of the 
RAc1 genomic clone according the the present invention; 

Figure 2a is a restriction map of the 15.1 Kb iambda-RAd 
insert, according to the present invention; 
5 Figure 2b is a restriction map of the pRAc 15\H3 insert 

according to the present invention; 

Figure 3 depicts maps of constructs according to the 
present invention. 

The identification of the complete genomic structure was 
1 0 accomplished utilizing the protocols described in the following 
examples: 



WO 91/09948 



PCT/US91/00073 



Example 1 

a. Screening the Iambda-gt11 c DNA library: 

A rice (Oryza sativa IR36) Iambda-gt11 cDNA library, 
5 prepared from six-day-old etiolated leaf tissue was plated and 
lifted onto nylon filters. Primary and secondary, screenings for 
rice actin cDNA clones were carried out by hybridization with a 
260 bp, 32p_dATP labeled BamH1-Hindlll restriction fragment 
from the rice actin genomic clone pRAc2. Tertiary screening was 
10 carried out using rice actin gene- specific probes. RAc1-like 
cDNA clones were identified using a 900-bp BamH1-Hindlll 
restriction fragment probe from the 3' end of the rice actin 
genomic clone pRAd. 

15 b. Restriction mapping and nucleotide sequencing: 

Actin positive clones from tertiary screening of the 
Iambda-gt11 cDNA library were further characterized by 
restriction mapping. Restriction fragments from the cDNA insert 
of the RAc1 positive clone pcRAd .3 were subcloned into pBS-KS 

2 0 vectors for nucleotide sequencing. Double-stranded DNA 
sequencing using 35S-dATP W as carried out using a T7 
polymerase sequencing kit following the manufacturer's 
(Pharmacia) modification of the Sanger et al dideoxynucleotide 
sequencing procedure. The 5' and 3* ends of the pcRAcl.3 cDNA 

25 insert were further sequenced using two synthetic 

oligonucleotides; 5' -AAGCGGCCTCTACTACGC-3' and 5*- 
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GAAG CATTTC CTG RGC AC A AT-3* respectively and subsequently the 
sequence data were analyzed. 

c. 5'-end mapping of RAcI transcript by northern blot analysis 
5 and primer extension: 

RNA isolation from seven-day-old rice shoots and Northern 
blotting were performed. 10 u.g of total RNA samples were used in 
formaldehyde agarose gel electrophoresis and northern analysis 
was carried out under stringent hybridization conditioned (50% 
10 formamide, 50°C). Restriction fragment probes for northern 

hybridization were isolated from the 5'-untranslated end of the 
rice actin genomic clone pRAd and subcloned into pBS-KS 
vectors. 

The primer extension analysis was performed using 1 |iQ of 
1 5 a synthetic oligonucleotide primer S'-CTTCTACCGGCGGCGGC-S' 
which was annealed to 25 u.g of total RNA from seven-day-old 
rice shoots. 
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The cDNA library made with mRNA from six-day-old 
etiolated rice shoots according to Example 1 provided the cDNA 
clones corresponding to the multiple members of the rice actin 
gene family. Primary and secondary screening with a 260 bp Bam 
5 HI-H/nd III restriction fragment - a fragment previously known to 
cross-hybridize with many members of the rice actin gene family 
- from the rice actin gene RAc2, because of the homology found 
among isolated rice actin genomic sequenced. Thirty four clones 
were identified which strongly hybridized to the non-specific 

1 0 RAc2 actin probe. 

Tertiary screening of the 34 actin-positive cloned was 
carried out using probes previously determined to be actin gene 
specific. Seven RAc1-like cDNA clones were identified which 
hybridized specifically to a 900 bp BamH1 -Hindlll restriction 
15 fragment from pRAd. The nature of these clones was confirmed 
by restriction mapping and Southern blotting. The EcoRI insert 
from the lambda-gt 11 clone lambda- RAc1.3 was subcloned into a 
pBS-KS vector to produce the plasm id pcRAc1.3. 

Using similar procedures, nine RAc3-like cDNA clones and 

2 0 eight RAc7-like cDNA clones were identified. No cDNA clones 

were identified which cross-hybridized with a RAc2 gene specific 
probe. The remaining ten actin-positive clones failed to cross- 
hybridize with any of the previously characterized actin gene- 
specific probes. These were classified into five groups on the 
25 basis of restriction mapping and cross-hybridization analysis. 
The isolation of cDNA clones representing eight distinct actin 
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transcripts confirms that the actin gene family in rice composed 
of at least eight unique members. 

The insert from pcRA1.3 was further subcloned into pBS-KS 
plasmids. Double stranded DNA was prepared for sequencing. The 
5 restriction enzyme map of the pcRACI .3 insert and the sequencing 
strategy for determining its base are summarized in Fig. la. 

More specifically, Fig. 1 describes the determination of the 
RAc1 gene structure by alignment of the pcRAc1.3 cDNA insert 
with the pRAd genomic clone a: restriction map of the pcRAcI 

10 cDNA insert with the strategy used to determine its base 
sequence. The horizontal arrows indicate the direction of 
sequencing and their length indicate the actual size of the 
sequence obtained. Horizontal arrows preceded with a sinusoidal 
wave indicate the use of synthetic oligonucleotide primers, b: 

15 restriction map and proposed structure of the RAc1 genomic 
clone. Horizontal lines represent introns and boxes represent 
exons. Open boxes represent the transcribed but untranslated 
regions of exons, closed boxes represent the translated regions of 
exons. Restrictions sites: B, BamH1 ; H, Hindlll ; R, EcoRV; S, Ssti. 

2 0 Dashed lined between the cDNA and genomic DNA restriction maps 
indicate the alignment of their homologous exons. 

The tabulated sequence which depicts the nucleotide 
sequence of the rice actin gene RAc1, is depicted in the following 
sequence of 551 0 nucleotides 



25 
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1 GATATCCCTC A3CCGCCTTT 

TCATTGTCAT GfrGAACCTTG 
8 1 GCGTCGATTT TCCTCTTATA 

TGATCGCGTC TGATTGGCTA 
161 TGGATGTATT TTCATACATA 

TCATTACACT TTATTAGAGG 
241 CACTATGTCT GGTGTTTCAT 

ATTGACTTCT GATTCAATAT 
321 GCAGTACTAA TTCTGGTTGA 

TCAGCTTCAC TCTATTAGGC 
401 ACGTACGAAT ACTCTATCAA 

TGAGACATGT ACTATAGATT 

4 81 TGTATGTATC ACGTCTAATT 

ATA 1 1 1 1 1 ( A ATAACATTAA 

5 61 ATGGTGTACG ATAACCACAA 

TTGCCTTACG CTGAAGTTAC 
641 ATTTTACTTT GACCACCCTT 

TGGAAGGCTC CAGCCGTAAT 
721 CCTTAGCCGG ATTATATTGC 

ATCTGCAAGA ATGTCTACTC 
801 AGCTAGCATA CTOGAGGTCA 

GTCGGGATAG TCGAAAATAA 
881 GTCAAAAGTG AAAACATCAG 

AATATCGGTA ATAAAAGGTG 
961 TTTTCTACTA TTATAAAAAT 

TTTGATACGT CATTTTTGTA 
1041 TTCGCGATTT TGGAAATGCA 

TTTAAGTTCG TTTGCTTTTG 
1121 TAAGAAATAT CTTTAAAAAA 

TAATTTTTGA GAAAAATATA 
1201 ATGAAGAATA ATAAGATTAA 

GCGATGGGTA 1 1 1 1 1 IC TAG 
1281 GACTCAAAAC ATTTACAAAA 

GXCAAAGTG CTATGO«£G<\ 



CACTATCTTT 

GCATGTATAA 

GGTCGGXAA 

GAGATATGTT 

ATCATATGCA 

7GGTGAGTAA 

TTTATTTGCT 

TTAAGGATOG 

CGGAACTATA 

ACTACTTGAC 

TTAGTCTGAG 

ACTATTGTAC 

G^ACTACACA 

AACCTACCTC 

GTTTA3G4G3 

GCCTTAAAAA 

CAATGTTGAC 

A3GATTCTGC 

CIQCCCACTT 

GCTAGAATTA 

TTCATATGCT 

AACAAAGGTA 

TTAAAAGGTG 

GDOCAAAGTG 

TG4GGATGTT 

TGAATTGGI I 

TATCTGTATT 

TAAATACAGA 

AOOCATATGC 

TATTCAGQOG 

AATAQCTTGC 

TAAAATAAAA 

ACAACCCCTA 

TCGATAGCAA 



TTTCDOCGAG 

TCGGTGAATT 

TGAATCCGTG 

TCTTCCTTGT 

TACAAATATT 

TAAACCCTAT 

TTTAAACGAA 

GAATOQGOGT 

CGTAAACTAT 

A2AO0GG^rr 

TCTAACTTGT 

CTTGACTCAC 

TATATACGCG 

TATGTCAACA 

TAAAAAAACA 

TAAAGAGTAA 

TTTAGAOOQG 

AOCTCACATG 

TCTCACTCAT 

TOGOGATAGT 

TGAGAAGAGA 

AGATTACCTG 

GTATAAGTAA 

AAATTTACTC 

TTGTCGGTAC 

TTTAAGTTTA 

TGAGTOG33T 

GGGATTTGTA 

TAATTTGACA 

AATTCTGACA 

OXOGTTQCA 

GATAAACTTA 

AAGTCCTAAA 

GXGflGCCCA 
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1361 ACCCA4C0CA. 

AAATASTCTC 
1441 CGCfiCCPCOG 

A3AAAAAAAA 
1521 TOGIQGGGGC 

CGAGGXQ30 
1601 ATCGCCACTA 

C0CA4CCCTA 
1681 TOGCIG00GG 

coooaooGsrr 

1761 CCG 1 1 1 1 1 1 1 

TAGTTTQGGn" 
1841 TGOGOGGGAG 

OGTGAGTOGG 
1921 GATCTAGA7C 

TTGAATCCCT 
2001 ATGATTTGTG 

TAQSTAG^AG 
2081 GTCTQOGATA 

TQGATCTCA3 
2161 TCTTQGTAQC 

TOGDOQGflGA 

2321 AGAQGGGTAT 

TATCGTCAGC 
2401 CAGACCTTCT 

ACXXXX3TOCT 
2481 GAATOGTGOG 

AAGACCCCTG 
2561 CICTGT A IGC 
' ACACTQAACT 
2641 ACATTCACAT 

TTGTCATACA 



AO0GAAO0CA 
GACAC0CCG3 
GAGGTCTOGC 
GAAAAAGAAA 
CGGAAAA90G 
CCTO0CTO0G 
TATACATACC 
CCflCCfC&C 
/CGAOGAGCT 
AACGAOOOOG 
TTTCGTCTCG 
GQGCG a G fl G3 
GQQOQQGMC 
COOGGATOCT 
TTCTTTCTTT 
CAGCATTGTT 
ACAAATQCAG 
ATGGCIGAOG 
ATOGAACTO3 
GGTGGTTTCC 
AAAACTGAQ3 
TGATGOGXC 
CCTOGOGACA 
ACGCCTACGT 
CTTGACOCTC 
AACTGGGATG 
AGAACGAGCT 
OCTCACOGAG 
AAGATGACCC 
CTATGTACGT 
CAGTGGTOGT 
AAAAGQCTGT 
GTAGATGAGA 
GCAAGACTAT 



COOGAGTOCA 
CACTATCACC 
AGOCAAAAAA 
AAACAGCAGG 
A33AG3ATOG 
CTTCCAAAGA 
CCCOCCTCIC 
GACCACGACC 
CCTCC000CT 
CCCGTCTCCT 
G7CTCGATCX 
GQCTTOGTOG 
TOG0G3CIG3 
G30GG3GAAT 
CTTCTTTTTG 
CATOGGTAGT 
CCTOGTOOGG 
CCGAGGATAT 
TATGGTCAAG 
GTTTACGGAA 
TGGI I IGIGT 
AGQQCTGTCT 
CCGGTGTCAT 
OGGO6A0GA3 
AAGTAOCCGA 
ATATGGAGAA 
C0GIG1QQDC 
GOIGCTCTCA 
AGATCATGTT 
CX3GGATOCAG 
A00GAAG31G 
GAGGATGAAT 
TTTAGTTGTG 
ATAATAGCTT 



G0CAACTG3C 

GTGAGI IGTC 

AAAAAAA3AA 

TGQGIC0G33 

OGAGCAGDGA 

AAOGXOOCC 

CTCOGATCCC 

TDJTOOOCCC 

C000CTO0QC 

CTTTCTTTCT 

TTQQOCTTGG 

(XEAGATCGG 

OGTCTCOGQG 

GGG3CTUTOG 

TQQGTAGAAT 

TTTTCTTTTC 

AGGI 1 1 1 1 IG 

CCAGCOOCTC 

GTAAGCTGTT 

ATGCTGCATT 

GAGGCTQQ3T 

TOCCGAGGAT 

GGTCGGAATG 

GCGGAGTCCA 

TOGAGGATQG 

GATCTGGCAT 

00GGAG3AGC 

A0GCGAAG3O 

TGAGAOCTTC 

GOOGIUJIUI 

AGGACATTOG 

TTTAATTTTG 

CAATCTTCAA 

TCAAAATAAA 
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25 



30 



2721 
2801 
2881 
2961 
3041 
3121 
3201 
3281 
3361 
3441 
3521 
3601 
3681 
3761 
3841 
3921 
4001 



ATCATAGGCA 
CCTAATTUTG 
GTGCATCAGT 
ATTGTGTTGG 
GCATCTATGA 
JCTCGfiCCTT 
AAGATCCTGA 
O0GAG0Q3GA 

ctacatcgcc 
aag4cca3ct 
atqgacaggt 

CCCTGAGGTC 
GCIQOGGGTA 
AGTGOG4CGT 
CGI IOICAGT 
G4CAG3ATG&. 

gcatgaag4t 
ca3r3ictcg 
acattccaqc 
ttgtttacct 
gcttgccgtt 

CGAGTCIQ3C 
TTCTTOGGAC 
TCGOOGfTOCT 
TCTTCATTAG 
TTATCATTTT 
TACCTGTGGT 
GGTTTATTCA 
GAAAAAATAC 
TCTATGAAAT 
TTCQCATGAA 
ATCTAGTGCA 
CGTAACT7CC 
ACATGTCACA 



GilClCATAA 
TTGQCATG3A 
GTGAATAACT 
ACTCTGGTGA 
A3GATATGCT 
QCTQ3GDGTG 
0GQAG0GIG3 

aattgtgagg 
ctqgactatg 
cctoogtosa 
tatcaccatt 
ctu1 igcagc 
tccatgagac 
ggajattagg 

GGTGGTACCA 
GCAAG3AAGA 
CAAG3TG3TC 
ATTCGAGGAT 
AGGTAAATAT 
CATGAACTTG 
GCAGATGIGG 
CCATCCATTC 
CCAAGAATGC 
CCTGCI IGI I 
CGTGGACAAA 
CTTCTATTCA 
TCTCACTTGT 
AATTTAATCA 
GTGTTCTGGT 
GAATAACATC 
TGI IGI 1 1 IC 
TGQGAGI I IG 
ACTTTGCCTT 
GACACTTCTG 



ATGGAATCAT 
GIGG1 1 IGAC 
GAATTTCCTT 
TGGTGICAGC 
CTCCCCCATC 
ATGTCACTGA 
TTACTCATTC 
GACATGAAGG 
ACCAGGAAAT 
GAAGAGCTAC 
GGIGCTGAGC 
CTTCCTTCAT 
TACATACAAC 
AAGGATCTAT 
CTATGTTCCC 
TCACTGOCTT 
GCCCCIGGIG 
(XATUITGGC 
ACAAATCCAG 
ATCAATTTGC 
ATTGOCAAGG 
TGCACAGGAA 
TAAGCCAAGA 
TCTCTCTTTT 
GTTTTCAAOC 
AAGACTGTAA 
GATnTGGAC 
GATGCCTGAT 
TGI 1 1 1 IGAG 
GAAGTTATCA 
TTCTGTCTTG 
TATTGTGATG 
TGCTGTTCGA 
CTACTTTTCT 



GTTTGAACAT 
ATTTTGAAGT 
TTCCCAAGGT 
OiCACTGTCC 
CTATCCTTCG 
TTACCTGATG 
ACCACAACG3 
AGAAGCTTTC 
GGAG4CTGDC 
GAGC1 IOCTC 
GTTTCCGCTG 
AG3AATOGAA 
TCCATCATGA 
ATGQCAACAT 
TGGCATTGCT 
GCroCTAQCA 
AAAGGAAGTA 
ATCTCTCAGC 
CAATGTAGTG 
TTACAATGTT 
CTGAGTACGA 
ATGCTTCTAA 
GGAGCTGTTA 
TGTTGCTGTT 
GQOCTATCTG 
TACCTATTGC 
ACATATGTTC 
GAGG3TACCA 
TTGCGATTAT 
TCCCAGTATT 
TGCATCAGTG 
TTCGACATCA 
TATTTTAATG 
TTUTTGGCTA 
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4081 TTGTGCCAGO 

TATTCTCATC 
4161 03CTTAAAAG 

GTPGATAATCT 
4241 AAATGGAACC 

ACAAGAATAT 
4321 TTCGTGGCTT 

TTCAACATGT 
4401 7UTCCAAGGT 

CCTGOCAATT 
4481 GCTTTCTATG 

TGCAAQCTTT 
4561 ACTTCAATTT 

CATGCTTATA 
4641 CGI 1 1 IGACT 

TTGTTTTAAA 
4721 ATGGAGATGT 

TGQCTGTTTA 
4801 ACTATACTAA 

TACTTGAGOG 
4881 0GAAG0Q3CA 

ACATCTAGAA 
4961 TGTTACTTAG 

GAAGATAAOC 
5041 TCAATCTGTA 

GCTAAAAGTT 
5121 AGCAACCTGA 

COOGAGACTT 
5201 AOGGTGAGGT 

AGCGATTCTT 
5281 TACATCTQGT 

CTTAGTATAT 
5361 TGCAGCAGCT 

CCTGACTUTG 



ATGATGGAAG 
GTCAGGCTTT 
TTGTAOCGCC 
TTTCACTTTT 
AIGTGGATCA 
TTGGTAGAGT 
GGCTCOCTGT 
TAGDGTGTAG 
AAAATGTQGC 
TTTTAAGTIT 
GTTTGTGTAC 
CTATGGTTTG 
TGTTAGAAAT 
GTAAATCACT 
TGAGAAGATT 
ATTACTTTAT 
GATCACAAAT 
TATATTATCA 
CTGTTCATAT 
GGAAAAGGGT 
GCAAATOGAG 
AAGCACATTT 
AAGCAG3AAG 
AGQQDQQDAC 
TACACTCAAA 
TCXXnGTTTC 
AGAGTTTGTT 
TGATCCTTCC 
CACTGTACCG 
TGCI I CI 1 1 1 
AIQGTGCGTA 
GOCGQCAGAA 
GCACTTAQCT 
GATGTGQCTT 



ATGCATCAGA 
CAGQDAGAGA 
AGTAGACATC 
CTTAAAGAAA 
GAGAAGCTTT 
GQQG3TDCTA 
CTTCAACCAA 
AAAGAGGACA 
ATTCTGTTAA 
CAA7GGAAGA 
ACAAGACAGT 
TGTGCQOGTT 
GAAAGAAAAA 
CI 1 1 1 ICCTC 
TTAAAATCTG 
GTATTTCCAT 
TGTTACACCA 
GACCAGGOTA 
TATUTGGAAA 
ATAGATATGA 
GGICICTUTG 
AACCTTTTCT 
TTGATGCAAA 
TOGAAGAGTT 
GTTACTCGGA 
ATTTGAACCA 
GTGCAAAGGT 
CTTGCATATC 
CAAGCATTAG 
GTOCAOOGTT 
CATCAAGGGC 
QCTACGACAA 
ATATTGCCAG 
GAGTATGGTC 



OGATCAGATA 
GCAOQCTTTG 
CCC7GTAGAA 
TTGAGAGGQG 
TGTTTCTTAC 
TGTTOGTGGG 
GTGTTTTCAG 
ATTCTGTTTA 
AGAACATGAT 
GGAATGTAAA 
GGAAGAGGAG 
GTGGIGOAGO 
AAAGGATGAT 
GCCTTCTGTA 
TACATGACCT 
CTTTCAAGTT 
ATCAGGAGGC 
TATAGAGTAT 
TCTTGCTTGC 
GGTTAAGGAA 
AAATCATTTT 
AGAACACATA 
ATTTCATCGA 
ATCT7TTACC 
TTGTACATTG 
CCTCAGCAAA 
AAAAAOCnC 
TAAGGQCATC 
TCGAAGACAA 
TCAATATGTA 
CAAGAATACT 
CTCTCAAACT 
AAGTATCATA 
CI I IGIGACA 
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5441 CTATAGACAG CiAATCAACOC ATCATTGTCA AGflCTAGflGA 
TATATAATAG CCTAAAGATO CAATGAATOC 5510 

An alignment between the sequence of the RAc1 cDNA clone 
and that of the RAc1 genomic sequence was used to determine the 
5 structure of the RAc1 gene shown above. Translation of the 

pcRA1.3 insert in all three reading frames identified a potential 
coding region of 1131 nucleotides. The potential coding region of 
the RAc1 cDNA, if translated in vivo, would code for 377 amino 
acids and an actin protein of 41.9 kDA estimated molecular mass. 

1 0 This analysis identified three introns, interrupting the gene at 
the same places in the RAc1 coding sequence as those previously 
reported for all other plant actins. The analysis also identified 
an additional intron within the transcribed sequence of RAc1 
which is 5' of the region containing the translation initiation 

1 5 codon. This 5'-'mtron separated a 79 bp GC-rich 5'-noncoding exon 
from an exon coding the translation initiation codon. 

In the sequence depicted above, the promoter according to 
the present invention lies within 0 to 2071 nucleotides, more 
particularly, the efficiency of the promoter appears to lie within 

20 about 800 to 2071 nucleotides; nucleotides 811 to 816 and 2066 
to 2070 are the Xhol and EcoRV enzyme sites, respectively; 2051 
to 3600 nucleotides encompass the coding region DNA sequence 
for the rice actin gene RAc1; and 1650 to 3841 nucleotides is the 
RAc1 genomic clone for pRArf. 

25 To determine if any of the previously isolated plant actin 

genes also contain 5'-intron-Uke sequences, the region 5* of their 
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respective translation initiation codons to that of RAc1, and 
those animal actin genes known to contain such 5*-introns as 
compared. This analysis revealed that the soybean (Glycine max) 
actin genes SAc3 and SAc1, the Arabidopsis thaliana actin gene 
5 AAc1, the potato (Solanum tuberosum) actin processed 

pseudogene PAc-psi, and the maize (Zea mays) actin gene MAc1 
have regions upstream of their translation initiation codons 
which bear sequence similarity to the 3'-splice site junction 
regions of the 5'-introns of the rice actin gene RAc1 , a Xenopus 
1 0 borealis cytoplasmic actin gene and the Drosophila melanogaster 
cytoplasmic actin gene DmA2. These previously confirmed, and 
putative intron splice sited are all found within 7 to 11 bp 
upstream of their respective translation initiation codons. 
Although the short region of untranslated exons they would 

1 5 encode are all AT-rich, they do not as a group suggest any strong 

consensus sequence. However, these sequences may serve a 
similar function in the different actin genes. 

The complete structural analysis of the RAc1 gene 
described above has therefore led to the identification and 
20 localization of a 5' noncoding exon, separated by a 5'-intron from 
the first coding exon, in the RAc1 genomic sequence. It has been 
reported that a 5-intron in the maize Adh1 gene is essential for 
the efficient expression of foreign genes from the maize Adh1 
promoter [Callis et al., 'Introns increase gene expression in 

2 5 cultured maize cells. Genes & Development, 1:1183 (1987)]. In 

order to investigate the effect of the RAc1 5'-intron on gene 
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expression, a number of GUS fusion plasmids containing RAc1 
intron deletions which involved the successive removal of those 
intron sequence elements previously determined as being 
important for efficient intron splicing, such as the mRNA branch 

5 point and 3'-donor splice sites. The results of transient assays of 
GUS activity in rice protoplasts transformed with the various 
deletion constructs suggest that the S'-intron of RAc1 is 
essential for efficient gene expression from the RAcl promoter. 
Test results also suggest that the intron-mediated stimulation of 

0 gene expression is not a function of the intron sequence per se but 
is associated, in part, with an in vivo requirement for efficient 
intron splicing. 

This analysis and following discussion of the identification 
and localization of the 5' noncoding exon in the RAc1 genomic 

5 sequence can be more easily understood with references to the 
foil o wi n g exa m p I es : 
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Example II 

Genomic Clone Characterization: 

A genomic restriction map of the 15.1 kb insert from 
5 lambda-RAcI was prepared by analyzing all possible single and 
double digests with the enzymes BamHI, EcoRI, Hindlll and Sail. A 
5.3 kb Hindlll-Hindlll restriction fragment from the lambda-RAcI 
clone was subcloned into pBluescript-KS to generate the plasmid 
pRAc15'.H3. A restriction map of pRAc15'.H3 was prepared by 

1 0 analyzing all possible single and double digests with the enzymes 

BamHI, BglJI, EcoRI, EcoRV, Hindi, Hindlll, Kpnl, Pstl, Smal, Sphl, 
Xbal and Xhol. Restriction digestion, plasmid ligation, 
transformation of E. coli DH5-alpha competent cells and isolation 
of plasmid DNA were done following standard procedures. 
15 A 2.1 kb EcoRV-EcoRV region within the pRAc15'.H3 insert 

was further subcloned into pBluescript-KS and its DNA sequence 
determined according to the sequencing strategy outlined in Fig. 
2B> Double stranded DNA sequencing reactions were run using the 
dideoxy chain termination method following T7 polymerase™ 

2 0 protocol. Analysis of DNA sequence data were performed using 

the Microgenie™ computer program. 
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EXAMPLE III 

Construction of Plasmids: 

The 2.1 kb EcoRV-EcoRV fragment from pRAc15'.H3 was 
5 subcloned into the Smal site of pBluescript-KS, in both 
orientations, to produce the plasmids pRAc15'.21V and 
pRAc15'.21 V r . A promoterless B-glucuronidase (GUS) gene, 
containing the 3' noncoding region of the nopaline synthase (NOS) 
gene, was excised by BamHI-Xbal digestion of pEXAG3 and cloned 

10 between the BamHI and Xbal sites of pRAc15'.21V and 
pRAc15\21 V to produce the plasmids pRAc15'.21VG and 
pRAc15'.21 VG. Deletion of the 0.8 kb Xhol-Xhol and 1.2 kb EcoRI- 
EcoRI fragments from pRAc 15'.21VG generated the plasmids 
pRAc15M3XG and pRAc15'.09RG respectively. Introduction of the 

15 1.2 kb EcoRI-EcoRt fragment into the EcoRI site of pRAc15'.09RG, 
in the reverse orientation, produced the plasmid pRAc15'.21 VR r G. 
The plasmid pRAc15\09RGIDS _ was produced by deletion of a 0,17 
kb BamHI-BamHI fragment from pRAclS'.OSRG. 

The plasmids pRAclS'.OGRGIAS, pRAclS'.OSRGIAI and 

20 pRAc15'.09RGlA12 were generated by cleavage of pRAc15\09RG at 
its Bglll site followed by exonuclease III deletion and S1 nuclease 
treatment to remove different amounts of the RAc1 first intron. 
To construct the plasmid pRAclS'.OgRGI" the 0.9 kb EcoRI-EcoRV 
fragment from pRAc15'.H3, containing the first intron of the rice 

2 5 RAc1 gene, was cloned between the EcoRI and EcoRV sites of 
pBR322 to produce the plasmid pBRAc15'.09R. The 0.4 kb Sstl- 
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EcoRV fragment from pBRAc15\09R, containing the RAc1 intron, 
was excised and replaced with the intronless 0.1 kb Sstl- EcoRV 
fragment from the insert of an RAcI cDNA clone, pcRAd.3, to 
produce the plasmid pBRAd 5'.09RI". The intronless 0.6 kb EcoRV 

5 fragment from pBRAc15'.09RI- was excised and cloned between 
the EcoRI and Smal sites of pBS-KS to yield the plasmid 
pRAc15\09R|-. Cloning of the Gus-Nos containing BamHI-Xbal 
fragment from pEXAG3 between the BamHI and Xbal site of 
pRAc15.09RI" produced the plasmid pRAc15\09RGI-. The nature 

0 of this RAc1 promoter-GUS fusion was confirmed by sequencing 
double-stranded DNA. 
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EXAMPLE IV 

Culture, transformation and visualization of GUS activity in rice 
cell suspension cultures and protoplasts: 

5 Cell suspension cultures were generated from calli, 

initiated from scutella of mature rice (Oryza sativa L. v Lemont) 

seeds, and cultured in liquid R2 media containing 3% (w/v) 

sucrose, 2 mg/ml 2,4-dichlorophenoxyacetic acid and 2 mg/l 

vitamin B-5. All cell suspensions were subcultured weekly and 

1 0 incubated in the dark at 26°C. Cell suspensions were filtered 

through a 700 u.m mesh prior to particle gun bombardment with 

1.2 u,m diameter tungsten particles mixed with a solution 

containing 10 jig of plasmid DNA. GUS activity in intact cells and 

developing calli was determined by the appearance of blue spots 

1 5 two and ten days after bombardment with plasmid DNA following 

the GUS assay procedure of Jefferson et at. 

Protoplasts were isolated from rice (Oryza sativa L. v 
Nipponbare) cell suspension cultures and resuspended in MaMg 
medium to a final density of 10 6 protoplasts/ml. For 
20 transformation, 1 ml of protoplast suspension was incubated with 
10 u,g of circular plasmid DNA, 50 ng/ml of calf thymus carrier 
DNA and an equal volume of 30% polyethylene glycol 4000. The 
mixture was incubated for 30 minutes, diluted with CPW13 
medium, washed a further 3 times in CPW13 media, with 

2 5 centrifugation between each wash, before being resuspended in 

simplified KPR liquid media to a final density of 10 6 
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protoplasts/ml. 0.1 ml aliquots of this final suspension were 
incubated by thin layer culture in 96-well plates prior to 
subsequent assays for GUS activity. 
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EXAMPLE V 

Analysis of GUS activity in transformed protoplasts: 

To monitor the temporal pattern of GUS expression for each 

5 plasmid in transformed rice protoplasts, two wells from each 
thin layer culture were assayed 1, 3, 5, 10, 20 and 30 days after 
transformation. Cells were incubated with 5-bromo-4-chloro-3- 
indolyly glucuronide in a standard assay procedure for 48 hours 
before visualizing GUS activity. 

0 For the quantitative analysis of GUS activity, cells were 

collected 20 days after transformation by low speed 
centrifugation in a bench top microcentrifuge. Total soluble 
protein was isolated in a GUS extraction buffer. Protein extracts 
were incubated with 1 mM methylumbelliferyl-B-D-glucuronide 

5 (MUG) in a standard assay at 37°C for 3 to 6 hours. The liberation 
of 4-methylumbelliferone (MU) was followed by measuring 
fluorescence with excitation at 365 mM and emission at 455 nM 
in a spectrofluorometer. Protein concentrations of plant extracts 
were determined by the dye-binding method of Bradford. 

0 Statistical analysis of the quantitative GUS activity results were 
done using a paired t-test. 
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To begin the analysis of the rice RAc1 regulatory regions, 
restriction mapping of a lambda-EMBL4 phage clone, lambda- 
RAd , which had previously been shown to contain the rice RAc.1 
gene was carried out. The resulting restriction map of the 15.1 
5 kb lambda-RAd insert is shown in Fig. 2A. This figure also 

indicates the position of the RAc1 coding and noncoding exons, as 
previously determined. To isolate and characterize the 5'-fianking 
sequence of the rice RAc1 gene, a 5.3 kb Hindi II fragment from the 
lambda-RAd insert, spanning a region from 3.9 kb upstream of 
1 0 the RAc1 coding sequence to a point within its third coding exon, 
was isolated and cloned into the Hindlll site of the plasmid 
pBluescript-KS to produce the plasmid pRAd 5'.H3. A restriction 
map of the pRAc15',H3 insert is shown in Fig. 2B. Restriction 
maps of the lambda-RAd (A) and pRAc15\H3 (b) inserts were 

1 5 determined by single and double restriction enzyme digestion, as 

detailed in the methods. The enzyme sites are abbreviated as 
follows: BamHI, B; Bglll, G; EcoRI, E; EcoRV, R; Hindi, H2; Hindlll, 
H3; Sphl, Sp; Sstl S; Xhol, Xh. The position of the 5.3 kb Hind III 
fragment within the lambda-RAd insert is indicated by dashed 

2 0 lines between (A) and (B). Noncoding and coding portions of RAc1 

exons are depicted by open and filled boxes respectively. The 
strategy used to sequence the 2.1 kb EcoRV fragment within the 
pRAc15'.H3 insert is indicated by horizontal arrows. The length 
of the horizontal arrows are indicative of the size of the 
2 5 sequence obtained. 
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The restriction map of the pRAc15'.H3 insert shown in Fig. 

2B also indicates the strategy used to sequence an EcoRV 

fragment which covers a region 2.1 kb upstream of the translation 

initiation codon of the RAd gene. This sequence is as follows: 
5 T 

801 AQCTAGCATA CTCGAG GTCA TTCATATQCT TGA34AGA3A Xho1 

GTCGGGATAG TCCAAAATAA AACAAW3GTA AGMTACCTG 

881 GTCAAAAGTG AAAACATCA3 TTAAAAGGTCi GTATAAGTAA 

AATATCGGTA ATAAAAGGTG GXCAAAGTTG AAATTTACTC 

1 0 961 TTTTCTACTA TTATAAAAAT TGAGGAIGTT TTGTCGGTAC 

TTTGATACGT CATTTTTGfrA TGAAT TGGTT TTTAAGTTTA i 

1041 TTCGOGATTT TGGAAAIGCA TATCTGTATT TGAGTCQGGT i 

TTTAAGTT QG TTTGCTTTTG TAAATACAGA GGGATTTGTA 

1121 TAAGAAATAT CTTTAAAAAA AOOCATATGC TAATTTGACA 

1 5 TAATTTTTGA GAAAAATATA TATTCAGG CG AATTCTCACA Eco Rl 

1201 ATGAAGAATA ATAAGATTAA AATAQCTTOC OOCCGTTOCA 

GCGATGGGTA 1 1 1 1 I ICTAG TAAAATAAAA GATAAACTTA 

1281 GACTCAAAAC ATTTACAAAA ACAA(XCCTA A^fTCCTAAA " 

GCCCAAAGTG OTATCCflCGA TCCATAGCAA GOQCA300CA Hi 

2 0 1361 ACCCAAOCCA ACraWCCCft CXXXVVglGCA GCCAftCTGGC 

AAATAGTCTC Cfl3CO00GG CACTATCACC GTGAGTTGTC 

1441 OGCPCCflCCG GAGGTCTOGC AGC CAAAAAA AAAAAAAGAA iv 

AGAAAAAAAA GAAAAAGAAA AAACAGGAQ 3 TCGGTO0G3G 

1521 TOGTG33GGO CG3<WVSG0G AGGOGGATOG OGfiGGAGOGA 

25 OGAGGGCGGO CCTO0CTO0G CTTCCAAAGA A43G0C000C 

1601 ATCGCCA CTA TATACA TACC OCOOCCTCTC CTCCCATCCC v 

(XCAACC CT4 CCAOCACCAC CACCfiCCACC TCCTOO OOOC vi 

1681 TCGCTGCC GG ACGACGAGCT CCTOO CCCCT COCOCTCOGC Sst I 

CGCCGCCGGT AAOCAO00CG CCCCTCTCCr CTTTCTTTCT 

3 0 17 61 CCGI 1 1 1 1 1 1 TTTCGTCTCG GTCTCGATCT TTGGCCTTGG 

TAGnTGGGT GGQOGflGOGC G3CI IOGICG OOCAGATOGG 

1841 TOOGOOOGflG GQQDQ93MC TOGCGGCTGG OGIUICOGGG 

0GTGA3IO3G CCC GGATOCT 0GDGQG3W G3GGGIC1U3 Bam HI 
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1921 GATGT AggEG HCTTTCTTT CTTCTTTTTG TGGGTAGAAT Bglli 

TTGAATCCCT CAQCATTGTT CATCGGTAGT TTTTCTTTTC 
2001 ATfiATT TGTG AC AAATQGAG CCTOGTODGG AGCI 1 1 1 1 IG vii 

TAG GTAGAAG ATG GCT GAC GCC GAG GATATC Eco RV 

5 Met Ala Asp Ala Glu Asp He 

In this sequence, nucleotides are numbered with the A of the 
RAcI transcription initiation site designated in bold. Restriction 
sites used in the subsequent construction of the various RAc1- 
GUS fusion plasmids and structural regions described in detail are 

1 0 underlined and named or designated by lower case Roman 

numerals and noted at the right margin of the sequence table. 
Upper case capital letters represent 5'-flanking sequences, upper 
case italic letters represent exon sequences and upper ease 
capital letters between the two sets of exon sequences represent 

15 the intron sequence. The codons of the RAct first coding exon 
have their translation product indicated below them. 

A number of potential regulatory sequences were identified 
in the 5'-flanking region of RAc1 gene. A 12 bp direct repeat, 
G GTTTTA AGTT (region i), is located between bases 1027 to 1038 

20 and 1078 to 1088. A tandem (imperfect) direct repeat of 16 bp, 
AA(G/C)CCC(T)AAAGT(G/C)CTA (region ii), is located between 
bases 1301 and 1333. 20 bp downstream of this tandem direct 
repeat are eight tandem copies of an imperfectly repeating 
pentamer with the consensus sequence CCCAA (region Hi). 

2 5 Finally, between bases 1465 and 1505, there is a purine rich 

sequence where 35 out of 40 bases are "A"s (region iv) 
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Downstream of these putative regulatory regions a number 
of sequences have been identified which have previously been 
implicated in the control of gene expression. Between bases 1609 
and 1617 there is a putative TATA box (region v). The sequence 
5 and position of the putative RAc1 TATA box are in agreement with 
that previously determined for a number of other plant genes, and 
the sequence around the transcription initiation site; CCTACCA is 
similar to the consensus sequence for transcription initiation 
previously determined for a number of animal YAYY and plant 

1 0 YYYAYYA genes (Y= pyrimidine). The noncoding exon located 3' of 

the TATA box is GC rich (77.5%) and consists of a number of 
tandemly repeated (A/T)CC triplets (region vi). 

The 5*-intron of the rice RAcI gene is 313 bp long. Its 5'- 
acceptor splice site [(G/G)TA], and 3'-donor splice site 
15 [TTTTTTGTA(G/G)] f follow the consensus sequence previously 
determined for a number of plant genes. A putative branch point 
site for mRNA splicing between bases 2008 and 2012 (region vii) 
was identified whose sequence, GTGAC, and distance from the 3- 
donor splice site bears similarity to the location and consensus 

2 0 sequence for animal mRNA branch point splice sites, YTR AC. 

Four RAc1 -GUS fusion plasmids were constructed to 
determine the minimum amount of RAc1 5'-flanking sequence 
required for maximal B-glucuronidase (GUS) gene expression; 
these are shown in Fig. 3. The plasmid pRAd 5\21 VG has the 2.1 
2 5 kb EcoRV restriction fragment from pRAc15'.H3 fused to the GUS 
coding region and S'-noncoding transcript terminator region of the 
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nopaline synthase (NOS) gene. This plasmid encodes a transcript 
containing two in frame translation start codons, one each from 
the RAc1 and GUS genes, adding 15 amino acids to the N-terminal 
end of the wild type GUS protein. The plasmids pRAd 5M3XG and 
5 pRAc15'.09RG were created by the deletion of 0.8 and 1.2 kb of 
sequence, respectively, from the 5' end of pRAc15'.21 VG. The 
plasmid pRAd 5'.21 VR r G has the 5* 1.2 kb EcoRI fragment cloned 
in the opposite orientation to that in pRAc15'.21 VG. 

A number of plasmids were also constructed to investigate 

10 the importance of the RAc1 5'-intron on RAc1 -promoter-mediated 
gene expression. The plasmid pRAc15'.09RGIA8 contains a 
deletion of 9 bp around the Bglll site of the RAc1 S'-intron. The 
plasmid pRAc15*,09RGIA1 contains a deletion of 133 bp between 
bases 1868 and 2003 of the sequence depicted above, reducing the 

1 5 distance between the 5' acceptor and mRNA branch point from its 
S'-intron. The plasmid pRAc15'.09RGIA12 has a deletion of 157 bp 
between bases 1868 and 2027, removing the putative mRNA 
branch point from its S'-intron. The plasmid pRAc15'.09RGIDS- 
was created by excision of a 170 bp BamHI fragment from 

2 0 pRAc15\09RG, removing the putative mRNA branch point and 3'- 
donor splicing sites from the RAc1 5'-intron. The plasmid 
pRAc15'.09RGI- is identical to pRAc15'.09RG except that it lacks 
the entire RAc1 S'-intron. 

The plasmid pRAc15'.21 VG contains the 2.1 kb EcoRV 

2 5 fragment from pRAd 5'.H3 cloned in the opposite orientation to 
that in pRAc15'.21 VG. It was postulated that the RAcI 5*-region 
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would not show bidirectional promoter activity because the 5'- 
most 1.2 kb EcoRV-EcoRI restriction fragment in pRAc15.21VG did » 
not show any binding when used as a probe in Northern 
hybridization against total RNA from 7 day old rice shoots. As a 
5 positive control, the plasmid pAhGusN was used. This plasmid 
contains the promoter, first exon and first intron of the maize 
Adh1 gene fused to a GUS coding sequence with the 3' noncoding 
region of the NOS gene. 

More specifically, the individual maps of constructs 
1 0 containing various portions of the 5'-flanking and 5'-transcribed 
sequence of the rice RAcI gene fused, in frame, to a sequence 
containing the GUS coding region and 3' NOS transcription 
terminator are shown in Figure 3. Open boxes represent noncoding 
exons of the RAd gene, filled boxed represent the GUS coding 

1 5 region (not to scale) and striped boxes represent the NOS 

terminator sequence. The stippled box in the construct pAhGusN 
represents the first exon of the maize Adh1 gene. The "A" symbol 
indicates the deletion point in the various RAc1 first intron 
deletion constructs. The indicated restriction enzyme sites are 
20 abbreviated as follows: BamHI, B1; Bglll, B2; EcoRI, R1; EcoRV, 
RV; Hindi, H2; Hindlll, H3; Sstl, S1; Xhol, Xh1; Xbal, Xb1. 

To assay GUS expression from the various RAcl-GUS fusion . 
plasmids; rice suspension culture cells were transformed by the 
biolistic method [see Wang, Y.C. et al., Transient expression of 

2 5 foreign genes in rice, wheat, and soybean cells following particle 

bombardment. Plant Mol. Biol. 11:433 (1987); and Cao, J. et al., 
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Transformation of rice and maize using the biolistic process. 
UCLA Symposia on Plant Gene Transfer (1989)], and assayed for 
high level GUS activity by visual inspection 10 days after 
transformation according to Jefferson et al. [GUS fusions: 6- 
5 glucuronidase as a sensitive and versatile gene fusion marker in 
higher plants. EMBO J. 6:3901 (1987)]. 

The results of these various assays are tabulated in the 
following table: 



TABLE I 



Protoplasts: 



Cell Suspension 



mean specific 
Activity 

Names nmol/hour/u.g 



meanGUS 
Expression 



Visible GUS 
Activity 



pRAc15\21VG 21 1 .41 

pRAc15M3XG 195.0"! 

pRAc15'.09RG 94.22 

pRAc15\21VRrG 92.52 



8.4 

7.9 

5.7 

6.1 

2.8 

1.4 

0.2 

0.2 

0.2 

0.0 

0.0 

4.2 



+ 



2 0 pRAc15'.09RGIA8 95.42 

pRAc15'.09RGIA1 53.43 

pRAc15\09RGIA12 17.0 4 
pRAc15'.09RGIDS- 8.85 

pRAc15'.09RGI- 4.75 



N.A. 
N.A. 
N.A. 
N.A. 



25 pRAc15'.21VrG 0.75 
Untransformed 0.05 
pAHGusN 31.96 



+ 



N.A. = not assayed 
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The results of the rapid assay are shown in the third column 
of Table 1. Suspension cultures transformed with the plasmids 
pRAc15 , .21VG > pRAc15M3XG, pRAc15'.09RG, pRAc15'.21VRrG and 
pAhGusN were all positive for GUS activity while untransformed 
5 cultures or those transformed with pRAc15'.09RGI- or 
pRAc15'.21 V r G displayed no visible GUS expression. 

Table I also shows the results of a determination of GUS 
expression efficiencies, defined as the percent of intact cells 
displaying visible GUS activity 10 days after PEG-mediated 

10 transformation of rice protoplasts with the RAc1-GUS fusion 
constructs. The plasmids pRAc15'21 VG and pRAc15M3XG 
displayed the highest GUS expression efficiencies at 8.4 and 7.9% 
respectively. The plasmids pRac15*.09RG and pRAc15'.21 VR r G 
showed GUS expression efficiencies of 5.7 and 6.1%, respectively. 

1 5 All four of the aforementioned plasmids displayed significantly 
higher GUS expression efficiencies than was found for pAI-|GusN. 

Removal of RAc1 5'-intron sequences resulted in a significant 
reduction in the number of cells displaying visible GUS activity, 
relative to pRAc15'.09RG t which contains the intact RAc1 5'- 

20 intron. For plasmids pRAc15\09RGIA8 (2.8%) and 

pRAc15'.09RGIA1 (1.4%), the longer the sequence deleted between 
the 5'-acceptor and the putative mRNA branch point splice sites, 
the lower the GUS expression efficiency. Removal of the branch 
point for mRNA splicing in the plasmid pRAcl5'.09RGIA12 and the 

25 additional removal of the 3'-donor splice site in the plasmid 
pRAc15'.09RGIDS- led to GUS expression efficiencies which, at 
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0.2%, represents less than 4% of that determined for the plasmid 
pRAc15'.09RG. Untransformed protoplasts or protoplasts 
transformed with either pRAc15*.21VG or the intronless plasmid 
pRAc15'.09RGI- gave no blue cells. 
5 To determine the quantitative differences in GUS expression 

from the various RAc1-GUS fusion plasmids, the constructs were 
introduced into rice protoplasts by PEG-mediated transformation, 
and GUS specific activity was assayed 20 days after the 
transformation procedure. The results of this quantitative 

1 0 analysis of GUS expression are presented in the first column of 

Table I. 

The highest GUS specific activities were recorded for the 
plasmids pRAc15'.21VG and pRAc15M3XG, which have the longest 
RAc1 upstream regions. The plasmid pAHGusN, which has the 
15 GUS gene expresses from the maize Adh1 promoter, displayed less 
than 17% of the GU specific activity shown by pRAc15\21VG and 
pRAc15M3XG. The plasmids pRAc15\09RG, with a truncated RAc1 
5'-region, and pRAc15'.21 VR r G, with an inverted RAcI ^-region, 
were found to have GUS specific activities around 48% of that for 

2 0 pRAc15'.21 VG. There was no statistically significant difference 

(mean specific activity with identical superscripted numbers are 
not statistically different from each other), in specific activity 
between these two constructs, suggesting that the inversion of 
the RAc1 5'-end did not influence GUS expression from the 
2 5 truncated RAd 5'. 09RG construct. 
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The effects of the various RAc1 intron deletions could be 
seen when their GUS specific activities were compared to that of 
their progenitor plasmid, pRAc15\09RG. The small 9 bp deletion 
in the RAc1 S'-intron of plasmid pRAc15'.09RGIA8 had no effect 
5 on the quantitative level of GUS activity. The plasmid 

pRAc15 a .09RGlAl, containing a deletion of 133 bp between the 5'- 
acceptor splice site and the putative mRNA branch point splice 
site, displayed a GUS specific activity that was less than 57% of 
that observed for pRAc15*.09RG. The plasmid pRAclS'.OgRGIAia, 

1 0 which had the putative mRNA branch point site removed, showed a 

further decrease in GUS specific activity of 17% of that observed 
for pRAc15'.09RG. The plasmid pRAc15\09RGIDS-, which lacks 
the putative mRNA splicing branch site and 3'-splicing donor site 
of the RAc1 S'-intron, and pRAc15'.09RGI-, which lacks the entire 
1 5 RAc1 S'-intron, displayed no significant GUS specific activities 
over that observed for untransformed samples. However, it was 
noted in Table I that while biue cells, indicative of GUS 
expression, were never observed in untransformed rice 
protoplasts or protoplasts transformed with the intron less 

2 0 plasmid pRAclS'.ogRGI-, GUS expression was able to be visualized 

in those rice protoplast transformed with the plasmid 
pRAC15',09RGIDS-. 

As a first step towards an understanding of the regulation 
of rice actin gene expression, the present invention has described 
25 the isolation and characterization of the 5'-regions of the rice 
RAc1 gene. Within the 5'-flanking sequence, the occurrence of a 
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long poly(dA) stretch located between bases 1465 and 1505 which 
appears to play a role In the constitutive activation of RAc1 gene 
expression was found. Based upon chemical analysis, it has been 
suggested that a minimal affinity for histone cores and 
5 nucleosome formation is provided by homogeneous tracts of 
purines located on one strand of the double helix. Poly(dA) 
regions within recombinant DNA molecules can prevent 
nucleosome formation in vitro, and it has also been found that 
naturally occurring poly(dA) tracts act as 5'-promoter elements 
10 for the constitutive expression of different yeast genes. It is 
believed that poly(dA) stretches may change the chromatin 
structure, enabling general transcription factors to access the 
DNA template and activate constitutive transcription in the 
absence of more specific transcription factors. 

1 5 The sequence of RAc1 untranslated mRNA 5'-region is 

unusually GC rich, with an AT content of only 26.5%. In a survey 
of 79 plant genes it was found that their mRNA 5'-sequences were 
extremely AT rich, with 71 of the genes surveyed having an AT 
content greater than 51% and only one of the genes surveyed 

2 0 having an AT content of less than 44%. In the ribosomal-scanning 

model of translation initiation it is proposed that the AT richness 
of the mRNA 5*-sequence leads to the formation of relatively 
labile secondary structures which facilitate the movement of the 
ribosomal subunits towards the translation initiation codon. The 
25 significance of the GC rich RAc1 mRNA 5'-sequence, with respect 
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to transcript stability and translation efficiency, remains to be 
investigated. 

The construction of a number of RAc1 -GUS fusion 
constructs has provided the determination that the plasmid 
5 pRAd 5M3XG, containing 0.83 kb of 5*-flanking sequence, the 
noncoding exon and 5'-intron of the RAc1 gene, has the minimal 
amount of RAc1 sequence necessary for maximal GUS expression 
in transient assays of transformed rice protoplasts; the 
additional 0.8 kb of RAc1 5'-sequence presents in the 
10 pRAc15'.13VG did not significantly increase GUS expression above 
that of the plasmid pRAc15'.13XG. It was also found in the 
present invention that the RAc1 5'-flanking sequence was more 
active than the maize Adhl 5'-flanking region in stimulating GUS 
expression in transformed rice protoplasts. Deletion of a 0.4 kb 
15 region from the 5*-end of the pRAc15'.13XG insert resulted in a 
52% reduction in GUS activity. However, no statistically 
significant difference in GUS specific activities was detected 
between the plasmids pRAC15\09RG, with the 0.4 kb region 
deleted, and pRAc15'.13.09RG, with the 0.4 kb deleted, and 
20 pRAc15'.21VRrG, with the 0.4 kb region present but inverted and 
displaced 0.8 kb further upstream. This suggests that the 
sequence element(s) in the 5'-end of the pRAc15M3XG insert that 
are responsible for high level GUS expression do not display any 
position/orientation-independent, enhancer-like activity. In most 
2 5 cases the differences in GUS specific activities detected between 
the RAc1-GUS fusion constructs could be directly correlated with 
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their differential GUS expression efficiencies. This suggests that 
there is a threshold to the visualization of GUS expression. 
Protoplasts transformed with the various R Act -GUS fusion 
constructs are presumed to display a position-effect-dependent 
5 distribution of GUS specific activities, the mean of which is a 
function of their particular RAc1 promoter. Therefore those 
protoplast populations transformed with the most active RAc1 
promoter-GUS fusion constricts will display the greatest number 
of visually detectable blue cells and the highest GUS expression 

10 efficiencies. 

By constructing RAc1 -intron-deletion-GUS plasmids, it was 
able to be shown that GUS expression in transformed rice 
protoplasts was dependent on the presence of an intact RAc1 5'- 
intron. Deletion of the RAc1 intron reduced GUS specific activity 

1 5 to levels that were not significantly greater than that of 

untransformed protoplasts. In transient assays of transformed 
maize protoplasts a maize Adh1-S gene lacking the nine Adh1-S 
introns was expressed at levels 50- to 100-times lower than that 
of the intact gene. The reintroduction of the S'-intron of the 

20 Adh1-S gene was enough to restore the level of expression to that 
observed for the intact gene, although this effect was only 
observed if the first intron was reinserted in a 5'-position; it is 
believed that this was a result of increases in the amount of 
mature, cytoplasmic mRNA and not a result of increased 

2 5 transcript stability in the nucleus or increased translation 
efficiency in the cytoplasm. 
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The increase in expression resulting from the presence of 
RAc1 5'-intron is not believed to be caused by the presence of 
some enhancer-like cis acting sequence within the intron. No 
sequences within the RAcl intron were found which bore any 

5 homology to either an enhancer-like sequence from the first 
intron of the human B-cytoplasmic actin gene, or to any 
sequences within the maize Adh1 5'-intron. Nor is it believed 
that the RAc1 first intron codes for any functionally active 
transcript or protein product. Previous Northern blot 

0 hybridization with a double stranded probe that spans the RAc1 
first intron failed to show any binding to total RNA from seven 
day old rice shoots, other than to the 1.7 kb RAc1 transcript 
itself. Finally, the RAc1 5'-intron contains no open reading 
frames of any significant length. One conclusion of the present 

5 invention suggests that the primary effect of RAc1 first intron is 
associated with an in vivo requirement for efficient splicing. 

The small deletion in the RAc1 intron in the plasmid 
pRAd 5'.09RGIA8 caused no significant reduction in GUS specific 
activity relative to that of pRAc15'.09RG. However, a significant 

0 difference was observed between the GUS specific activities of 
the plasmids pRAc15\09RG and pRAc15'.09RGIA1. The 133 bp 
deletion in the plasmid pRAc15'.09RGlA1 , while not removing any 
of the sequences previously implicated in intron splicing, did 
cause a reduction in the distance between the 5'-acceptor and 

5 putative mRNA branch point splice sites with an associated 44% 
reduction in GUS specific activity. Large reductions in splicing 



WO 91 /09948 PCT/US91 /00073 

37 



efficiency occur in vivo and in vitro when the distance between 
the S'-acceptor and branch point splice sites is reduced; this 
requirement for a minimal distance between the 5'-acceptor and 
branch point splice sites probably reflects a requirement for 
5 multiple splicing factors to interact with specific intron regions, 
these regions being presumably removed in the construction of 
pRAc15\09RGlA1. The in vivo requirement for efficient intron 
splicing was further supported by the observation that a deletion 
which removed the putative 5'-intron mRNA branch point splice 

10 site in the construction of pRAd 5\09RGIA12, or removed both 
the branch point and 3'-donor splice sites in the construction of 
pRAc15'.09RlDS- led to GUS specific activities which were 18% 
and 9% of that observed for pRAc15\09RG. Deletion of the normal 
branch point and S'-donor splice sites of animal genes does not 

15 abolish splicing but rather leads to the use of cryptic branch 
point and S'-donor splice sites, with associated reductions in 
splicing efficiency and gene expression; this may be occurring in 
rice protoplasts transformed with the plasmids 
pRAc15'.09RGlA12 and pRAc15'.09RGIDS", leading to reduced but 

2 0 still detectable GUS expression 

The results of the present invention clearly demonstrate 
that the RAcI S'-flanking sequence between 0 and 2070 
nucleotides and more particularly between 800 and 2070 
nucleotides in the genomic sequence contains an efficient 

2 5 promoter for rice transformation. Furthermore, the expression of 
a foreign gene in transformed plant cells can be dependent upon, 
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but not necessarily an absolute function of, the presence of an 
intact 5'-intron sequence. It appears that a functional 
requirement for the presence of the 5*-intron may be correlated 
with the conservation of the 5* noncoding exon during this rice 
5 actin gene's structural evolution. In addition the results of the 
present invention indicate that the maize S'-intron is located 
between the first and second coding exons of the Adh1 gene, while 
the rice S'-intron is located between a 5' noncoding exon and the 
first coding exon of the RAd gene, suggesting that there may be a 

1 0 common positional component to the intron mediated stimulation 
of gene expression observed for the maize and rice 5-intron. 

In summary, the preceding description of the present 
invention clearly demonstrates that a 2.1 kbp 5' of the Act1 
gene's translation initiation codon, containing 1.3 kb of 5* 

15 untranscribed sequence, the 5' transcribed but untranslated exon, 
5*-intron and part of the first coding exon of the rice Act1 gene, 
is capable of conferring high level expression of foreign gene in 
transformed rice material. Thus this region can be used to 
activate the constitutive expression of foreign genes in 

20 transgenic plants of rice and other agronomically important 
plants; the 5*-intron of the rice Act1 gene can stimulate the 
expression of a foreign gene in transformed rice material [thus 
this (and the other introns of the rice Act1 gene) will be able to 
increase the expression of foreign genes in transformed plants of 

2 5 rice (and other agronomically important plants) when ligated in 
front of such a gene in between a promoter and the gene]; and 
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although no specific function for the 3'-end of the rice Act1 gene 
in the regulation of foreign gene expression in rice has been 
demonstrated. It appears from the present invention that the 3'- 
end of the rice Act1 gene should also stimulate the expression of 
5 such foreign genes in transformed plants of rice and other 
agronomically important crops. 

Thus, while we have illustrated and described the preferred 
embodiment of our invention, it is to be understood that this 
invention is capable of variation and modification and we 
1 0 therefore do not wish or intend ourselves to be limited to the 
precise terms set forth, but desire to avail ourselves of such 
changes and alterations which may be made for adapting the 
invention to various usages and conditions. Accordingly, such 
changes and alterations are properly intended to be within the 

1 5 full range of equivalents, and therefore within the purview of the 

following claims. 

Having thus described our invention and the manner and a 
process of making and using it in such clear, full, concise and 
exact terms so as to enable any person skilled in the art to which 

2 0 it pertains, or with which it is mostly nearly connected, to make 

and use the same; 
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WE CLAIM: 

1. A promoter for monocotyledonous plants including a 
DNA sequence comprising a 5-region of a rice RAc1 gene that 
directs the expression of a linked structural gene in the tissues 
of a recipient plant. 

2. A promoter for monocotyledonous plants according to 
Claim 1 comprising the region extending from approximately 
position 1 to approximately position 2070 of the RAc1 gene. 

3. A promoter for monocotyledonous plants according to 
Claim 2 containing the region extending from approximately 
position 800 to approximately position 2070 of the RAc1 gene. 

4. A recombinant monocotyledonous plant comprising a 
structural gene for the production of a plant produced product 
linked with a promoter comprising a 5-region of a rice RAcl gene 
that directs the expression of the linked structural gene. 

5. A genetic construct selected from the group 
consisting of pRAc15'.21 VG, pRAc15\13XG, pRAc15'.09RG, 
pRAc15\21VRrG, pRAc15'.09RGIA8, and pRAc15\09RGIA1. 

6. A regulatory element for RAc1 gene promoter activity 
comprising approximately a 1.3 kb genetic sequence located in the 
S'-region upstream portion of the RAc1 translation initiation 
codon. 

7. A genetic construct comprising the RAcl 5' intron 
ligated in between a suitable promoter and a structural gene to 
enhance the expression of the latter. 
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8, A genomic sequence comprising the RAc1 sequence 
including a 5' non-coding exon separated by a S'-intron from the 
first coding exon in the sequence. 

9. A genomic sequence comprising a fragment having a 
restriction map selected from: 
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